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\V This application is a continuation-in-part of USSN 08/208, 264, A (now pending) whteh is a 

^ x continuation-in-part of Ser. No. 08/179,904, filed January 11, 1994, both of which application are 

5 hereby incorporated by reference in their entireties. A 
BACKGROUND OF THE INVENTION 
Field of the Invention 

This invention relates to novel classes of proteins and protein analogues which bind to and inhibit 
human plasma kallikrein. 
10 Description of the Background Art 

Kallikreins are serine proteases found in both tissues and plasma. Plasma kallikrein is involved 
in contact-activated (intrinsic pathway) coagulation, fibrinolysis, hypotension, and inflammation. 
(See BH0092). These effects of kallikrein are mediated through the activities of three distinct 
physiological substrates: i) Factor XII (coagulation), ii) Pro-urokinase/plasminogen (fibrinolysis), 
15 and iii) Kininogens (hypotension and inflammation). 

Kallikrein cleavage of kininogens results in the production of kinins, small highly potent 
bioactive peptides. The kinins act through cell surface receptors present on a variety of cell types. 
Intracellular heterotrimeric G-proteins link the kinin receptors to second messenger pathways 
including nitric oxide, adenyl cyclase, phospholipase A 2 , and phospholipase C. Among the 
20 significant physiological activities of kinins are: (i) increased vascular permeability; (ii) 
vasodilation; (iii) bronchospasm; and (iv) pain induction. Thus, kinins mediate the life-threatening 
vascular shock and edema associated with bacteremia (sepsis) or trauma, the edema and airway 
hyperreactivity of asthma, and both inflammatory and neurogenic pain associated with tissue 
injury. The consequences of inappropriate plasma kallikrein activity and resultant kinin production 
25 are dramatically illustrated in patients with hereditary angioedema (HA). HA is due to a genetic 
deficiency of CI -inhibitor, the principal endogenous inhibitor of plasma kallikrein. Symptoms of 
HA include edema of the skin, subcutaneous tissues and gastrointestinal tract, and abdominal pain 
and vomiting. Nearly one-third of HA patients die by suffocation due to edema of the larynx and 
upper respiratory tract. Kallikrein is secreted as a zymogen (prekallikrein) that circulates as an 
30 inactive molecule until activated by a proteolytic event that frees the +NH 3 -IVGGTNSS... 
sequence of kallikrein (SEQ ID NO. 1). Human Plasma Prekallikrein is found in Genebank entry 




P03952. 

Nature plasma Kallikrein contains 619 amino acids. Hydrolysis of the Arg 37l -Ile 372 peptide 
bond yields a two-chain proteinase joined by a disulfide bond. The amino-terminal light chain 
(248 residues) carries the catalytic site. 

The main inhibitor of plasma kallikrein (pKA) in vivo is the CI inhibitor; see SCHM87, 
pp. 27-28. CI is a serpin and forms an essentially irreversible complex with pKA. Although 
bovine pancreatic trypsin inhibitor (BPTI) was first said to be a strong pKA inhibitor with = 
320 pM (AUER88), BERN93 indicates that its Kj for pKA is 30 nM (i.e., 30,000 pM). The G36S 
mutant had a of over 500 nM. Thus, there is a need for a safe kallikrein inhibitor. The essential 
attributes of such an agent are: 

i. Neutralization of relevant kallikrein enzyme(s); 

ii. High affinity binding to target kallikreins to minimize dose; 

iii. High specificity for kallikrein, to reduce side effects; and 

iv. High degree of similarity to a human protein to minimize potential 
immunogenicity and organ/tissue toxicity. i/j /r 

The candidate target kallikreins to be inhibited are ehymotrypin^homologou s serine proteases. 
Excessive Bleeding 

Excessive bleeding can result from deficient coagulation activity, elevated fibrinolytic activity, or 
a combination of the two. In most diatheses one must controll the activity of plasmin. However, 
plasma kallikrein (pKA) is an activator of plasminogen and a potent, selective pKA inhibitor may 
avert plasminogen activation. The clinically beneficial effect of BPTI in reducing blood loss is 
thought to result from its inhibition of plasmin (K D - 0.3 nM) or of plasma kallikrein (K D ~ 100 
nM) or both enzymes. It has been found, however, that BPTI is sufficiently antigenic that second 
uses require skin testing. Furthermore, the doses of BPTI required to control bleeding are quite 
high and the mechanism of action is not clear. Some say that BPTI acts on plasmin while others 
say that it acts by inhibiting plasma kallikrein. FRAE89 reports that doses of about 840 mg of 
BPTI to 80 open-heart surgery patients reduced blood loss by almost half and the me^amount 
^ transfused was decreased by 74%. Miles Inc. has recently introduced Trasylol in^USA for 
reduction of bleeding in surgery (See Miles product brochure on Trasylol, which is hereby 
incorporated by reference.) LOHM93 suggests that plasmin inhibitors may be useful in 
controlling bleeding in surgery of the eye. SHER89 reports that BPTI may be useful in limiting 




3 

bleeding in colonic surgery. 

A kallikrein inhibitor that is much more potent than BPTI and that is almost identical to 
a human protein domain offers similar therapeutic potential, allows dose to be reduced, and 
poses less potential for antigenicity. 
5 With recombinant DNA techniques, one may obtain a novel protein by expression of a 

mutated gene of a parental protein. Several strategies are known for picking mutations to test. 
One, "protein surgery", involves the introduction of one or more predetermined mutations within 
the gene of choice. A single polypeptide of completely predetermined sequence is expressed, and 
its binding characteristics are evaluated. 
10 At the other extreme is random mutagenesis by means of relatively nonspecific mutagens 

such as radiation and various chemical agents, see Lehtovaara, E.P. Appln. 285,123, or by 
expression of highly degenerate DNA. It is also possible to follow an intermediate strategy in 
which some residues are kept constant, others are randomly mutated, and still others are mutated 
in a predetermined manner. This is called "variegation". See Ladner, et al. USP 5,220,409. 
1 5 DENN94a and DENN94b report selections of Kunitz domains based on APP-I for binding 

to the complex of Tissue Factor with Factor VII a . They did not use LACI-K1 as parental and did 
not use pKA as a target. The highest affinityjpinder they obtained had K D for their target of about 
2 nM. Our first-round selectants founding to pKA have affinity of about 0.3 nM, and our 
second round selectants are about irc 0.1 nM (= 100 pM) or better. 
20 Proteins taken fro m a particular spe cies are assumed to be less likely to cause an immune 

response when injected into individuals of that species. Murine antibodies are highly antigenic in 
humans. "Chimeric" antibodies having human constant domains and murine variable domains are 
decidedly less antigenic. So called "humanized" antibodies have human constant domains and 
variable domains in which the CDRs are taken from murine antibodies while the framework of the 
25 variable domains are of human origin. "Humanized" antibodies are much less antigenic than are 
"chimeric" antibodies. In a "humanized" antibody, fifty to sixty residues of the protein are of non- 
human origin. The proteins of this invention comprise, in most cases, only about sixty amino acids 
and usually there are ten or fewer differences between the engineered protein and the parental 
protein. Although humans do develop antibodies even to human proteins, such as human insulin, 
30 such antibodies tend to bind weakly and the often do not prevenjthe injected protein from 
displaying its intended biological function. Using a protein frorr^kg, species to be treated does 




not guarantee that there will be no immune response. Nevertheless, picking a protein very close 
in sequence to a human protein greatly reduces the risk of strong immune response in humans. 

Kunitz domains are highly stable and can be produced efficiently in yeast or other host 
organisms. At least ten human Kunitz domains have been reported. Although BPTI was thought 
at one time to be a potent pKA inhibitor, there are, actually, no human Kunitz domains that 
inhibits pKA very well. Thus, it is a goal of this invention to provide sequences of Kunitz domain 
that are both potent inhibitors of pKA and close in sequence to human Kunitz domains. 

The use of site-specific mutagenesis, whether nonrandom or random, to obtain mutant 
binding proteins of improved activity, is known in the art, but does not guarantee that the mutant 
proteins will have the desired target specificity or affinity. Given the poor anti-kallikrein activity 
of BPTI, mutation of BPTI or other Kunitz domain proteins would not have been considered, 
prior to this invention, a preferred method of obtaining a strong binder, let alone inhibitor, of 
kallikrein. 

SUMMARY OF THE INVENTION 

This invention relates to novel BPTI-homologous Kunitz domains, especially LACI homologies, 
which inhibit one or more plasma (and/or tissue) kallikreins, and to the therapeutic and diagnostic 
use of these novel proteins. In particular, this invention relates to Kunitz domains derived from 
Kunitz domains of human origin and especially to the first Kunitz domain of LACI; Kunitz 
domains of human origin are likely to be non-immunogenic in humans. The proteins^of this 
invention inhibit plasma kallikrein (and/or tissue kallikrein) with a K D of no more than 20 nM, 
preferably, no more than 5 nM, more preferably, no more than about 300 pM, and most 
preferably, no more than about 100 pM. 

A specific, high affinity inhibitor of plasma kallikrein (and, where needed, tissue kallikrein) 
will demonstrate significant therapeutic utility in all pathological conditions mediated by kallikrein, 
and especially those associated with kinins. The therapeutic approach of inhibiting the catalytic 
production of kinins is considered preferable to antagonism of kinin receptors, since in the absence 
of kallikrein inhibition, receptor antagonists must compete with continuous kinin generation. 
Significantly, genetic deficiency of plasma kallikrein is benign and thus, inhibition of plasma 
kallikrein is likely to be safe. We have recently discovered a lead pKA inhibitor, designated 
KKH/3#6. This inhibitor is a variant of a naturally occurring human plasma protein Kunitz domain 




and demonstrates significantly greater kaUikrein binding potency than Trasylol. KKII/3#6 has a 
Kj for kaUikrein which is over 100 times that of both wild-type LACI and of BPTI, and is about 
300 pM. In contrast, its K< for plasmin is 10 uM. Proteins KK2/#1 1 and KK2/013 are especially 
preferred pKA inhibitors and have Kj < 300 pM and probably less than 100 pM. A reversible 
5 inhibitor is believed to be of greater utility than an irreversible inhibitor such as the CI inhibitor. 

Transfer of the subsequences that confer pKA binding into other Kunitz domains, 
particularly human Kunitz domains is disclosed. 

The preferred pKA inhibitors of the present invention fullfill one or more of the following 
desiderata: 

10 1) the inhibitor inhibits plasma kaUikrein with a K; no more than 20 nM, preferably 

5 nM or less, more preferably 300 pM or less, and most prefFerably 100 pM or 
less, . 

2) the inhibitor eempme-a Kunitz domain meeting the requirements shown in Table 
^ 1 4 with residues ftw3>er1>y reference to BPTI, 

15 3) the inhibitor has at the Kunitz domain positions 12-21 and 32-39 one of the 

amino-acid types listed for that position in Table 1 5, and 
4) the inhibitor is substantially homologous to a reference sequence of essentially 
human origin selected from the group KKII/3 #6, KK2/#1 1 , KK2/# 13, KK2/# 1 , 
KK2/#2, KK2/#3, KK2/#4, KK2/#6, KK2/#7, KK2/#8, KK2/#9, KK2/#10, 
20 KK2/#12, KK2conl, Human LACI-K2, Human LACI-K3, Human collagen a3 

KuDom, Human TFPI-2 DOMAIN 1, Human TFPI-2 DOMAIN 2, Tluman 
TFPI-2 DOMAIN 3, HUMAN ITI-K1, Human ITI-K2, HUMAN PROTEASE 
NEXIN-II, Human APP-I, DKM.2.1, DKM.3.1, DKI-2.1, DK.T-3.1.1, DKI- 
3.2. 1 , DKI-3.3 . 1, DKI-4. 1.1, DKI-4.2. 1 , DKI-4.2.2, DKI-5. 1, and DKI-6. 1 
25 NOMENCLATURE 

Herein, affinities are stated as K D (K D (AB)=[A][B]/[A-B]). A numerically smaller K D reflects 
higher affinity. For the purposes of this invention, a "kaUikrein inhibiting protein" is one that binds 
and inhibits a specified kallikxein with K; of about 20 nM or less. "Inhibition" refers to blocking 
the catalytic activity of kaUikrein and so is measurable in vitro in assays using chromogenic or 
30 fluorogenic substrates or in assays involving macromolecules. 

Amino-acid residues are discussed in three ways: full name of the amino acid, standard 
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three-letter code, and standard single-letter code. The text uses full names and three-letter code 
where clarity requires. 



A = Ala 


, G - Gly 


M = Met 


S = 5er 


C = Cys 


H = His 


N = Asn 


T = Thr 


D = Asp 


I = lie 


P = Pro 


V = Val 


E = Glu 


= Lys 


Q = Gin 


W = Trp 


F = Phe / 


L = Leu 


R = Arg 


Y = Tyr 



For the purposed of this invention, "substantially homologous" sequences are at least 5 1%, more 



preferably at least 80%, identical, over any Gpconioa regions. For this invention, "substantially 
homologous" includes exact identity. Sequences would still be "substantially homologous" if 
within one region of at least 20 amino acids they are sufficiently similar (51% or more) but outside 
the region of comparison they differed totally. An insertion of one amino acid in one sequence 
relative to the other counts as one mismatch. Most preferably, no more than six residues, other 
than at teiroinL, are different. Preferably, the divergence in sequence, particularly in the specified 
regions, is in the form of "conservative modifications". 
"Conservative modifications" are defined as 

(a) conservative substitutions of amino acids as defined in Table 9; and 

(b) single or multiple insertions or deletions of amino acids at termini, at domain boundaries, 
in loops, or in other segments of relatively high mobility. 

Preferably, except at termini, no more than about six amino acids are inserted or deleted at any 
locus, and the modifications are outside regions known to contain important binding sites. 
Kunitz Domains 

Herein, "Kunitz domain" and "KuDom" are used interchangeably to mean a homologue of BPTI 
(not of the Kunitz soya-bean trypsin inhibitor). A KuDom is a domain of a protein having at least 
51 amino acids (and up to about 61 amino acids) containing at least two, and preferably three, 
disulfides. Herein, the residues of all Kunitz domains are numbered by reference to BPTI 
(i.e. residues 1-58, amino-acid sequence in Table 2). Thus the first cysteine residue is residue 5 



and the last cysteine is 55. An amino-acid sequence shall, for the-ptH=peseif of this invention, be 
deemed a Kunitz domain if it can be aligned, with three or fewer mismatches, to the sequence 
shown in Table 14. An insertion or deletion of one residue shall count as one mismatch. In Table 



14, "x" matches any amino acid and "X M matches the types listed for that position. Disulfide^ 
bonds link at least two of: 5 to 55, 14 to 38, and 30 to 51. The number of disulfides may be 
reduced by one, but none of the standard cysteines shall be left unpaired. Thus, if one cysteine 
is changed, then a compensating cysteine is added in a suitable location or the matching cysteine 
is also replaced by a non-cysteine (the latter being generally preferred). For example, Drosophila 
fiinebris male accessory gland protease inhibitor has no cysteine at position 5, but has a cysteine 
at position -1 (just before position 1); presumably this forms a disulfide to CYS 55 . If Cys 14 and 
Cys 38 are replaced, the requirement of Gly 12 , (Gly or Ser) 37 , and Gly 36 are dropped. From zero 
to many residues, including additional domains (including other KuDoms), can be attached to 
either end of a Kunitz domain. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

Protease inhibitors, such as Kunitz domains, function by binding into the active site of the 
protease so that a peptide bond (the "scissile bond") is: 1) not cleaved, 2) cleaved very slowly, or 
3) cleaved to no effect because the structure of the inhibitor prevents release or separation of the 
cleaved segments. In Kunitz domains, disulfide bonds act to hold the protein together even if 
exposed peptide bonds are cleaved. From the residue on the amino side of the scissile bond, and 
moving away from the bond, residues are conventionally called PI, etc. Residues that 

follow the scissile bond are called PI', P2', P3\ etc. (SCHE67, SCHE^). It is generally accepted 
that each serine protease has sites (comprising several residues) SI, S2, etc. that receive the side 
groups and main-chain atoms of residues PI, P2, etc, of the substrate or inhibitor and sites ST, 
S2', etc, that receive the side groups and main-chain atoms of PT, P2', etc. of the substrate or 
inhibitor. It is the interactions between the S sites and the P side groups and main chain atoms 
that give the protease specificity with respect to substrates and the inhibitors specificity with 
respect to proteases. Because the fragment having the new amino terminus leaves the protease 
first, many worker designing small molecule protease inhibitors have concentrated on compounds 
that bind sites SI, S2, S3, etc, 

LASK80 reviews protein protease inhibitors. Some inhibitors have several reactive sites 
on one polypeptide chain, and these domains usually have different sequences, specificities, and 
even topologies. It is known that substituting amino acids in the P 5 to P 5 * region influences the 
specificity of an inhibitor. Previously, attention has been focused on the PI residue and those very 
close to it because these can change the specificity from one enzyme class to another. LASK80 
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suggests that among KuDoms, inhibitors with Pl=Lys or Arg inhibit trypsin, those with Pl=Tyr, 
Phe, Trp, Leu and Met inhibit chymotrypsin, and those with Pl=Ala or Ser are likely to inhibit 
elastase. Among the Kazal inhibitors, LASK80 continues, inhibitors with Pl=Leu or Met are 
strong inhibitors of elastase, and in the Bowman-Kirk family elastase is inhibited with Pl=Ala, but 
not with Pl-Leu. Such limited changes do not provide inhibitors of truly high affinity (i.e. better 
than 1 to 10 nM). 

KuDoms are defined above. The 3D structure (at high resolution) of BPTI (the archetypal 
Kunitz domain) is know^/ One of the X-ray structures is deposited in the Brookhaven Protein 
Data Bank as M 6P TI"]. T he 3D structure of some BPTI homologies (EIGE90, HYNE90) are 
known. At least seventy KuDom sequences are known. Known human homologues include three 
KuDoms of LACI (WUNT88, GIRA89, NOV089), two KuDoms of Inter- a -Trypsin Inhibitor, 
APP-I (KIDO88), a KuDom from collagen, and three KuDoms of TFPI-2 (SPRE94). 
LACI 

Lipoprotein-associated coagulation inhibitor (LACI) is a human serum phosphoglycoprotein with 
a molecular weight of 39 kDa (amino-acid sequence in Table 1) containing three KuDoms. We 
refer hereinafter to the protein as LACI and to the Kunitz domains thereof as LACI-K1 (residues 
50 to 107 ), LACI-K2 (residues 121 to 178), and LACI-K3 (213 to 270). The cDNA sequence 
of LACI is reported in WUNT88. GIRA89 reports mutational studies in which the PI residues 
of each of the three KuDoms were altered. LACI-K1 inhibits Factor Vila (F.VIIJ when F.VTI a 
is complexed to tissue factor a^d LACI-K2 inhibits Factor X a . It is not known whether LACI-K3 
inhibits anything. Neither LACI nor any of the KuDoms of LACI is a potent plasma kallikrein 
inhibitor. 

In one preferred embodiment of this invention, KuDoms are substantially homologous 
with LACI-K1, but differ in ways that confer strong plasma kallikrein inhibitory activity discussed 
below. Other KuDoms of this invention are homologous to other naturally-occurring KuDoms, 
particularly to other human KuDoms. For use in humans, the proteins of this invention are 
designed to be highly similar in sequence to one or another human KuDom to reduce the risk of 
causing an immune response. 

Variegation of a protein is typically achieved by preparing a correspondingly variegated 
mixture of DNA (with variable codons encoding variable residues), cloning it into suitable 
vectors, and expressing the DNA in suitable host cells. For any given protein molecule of the 
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library, the choice of amino acid at each variable residue, subject to the above constraints, is 
random, the result of the happenstance of which DNA expressed that protein molecule. 
FIRST LACI-K1 LIBRARY SCREENED FOR pKA BINDING 

Applicants have screened a first large library of LACI-K1 domains (patern of variegation is shown 
5 in Table 21), with the results shown in Table 3. In Table 3, "Library Residues" are those 
permitted to occur, randomly, at that position, in the library, and "Preferred Residues" are those 
appearing at that position in at least one of the 10 variants identified as binding to human 
kallikrein. 

At residues 13, 16, 17, 18, 31, and 32, the selections are very strong. At position 34, the 
10 selection for either SER or THR is quite strong. At position 39, the selection for GLY is strong. 
Position 19 seems to be rather tolerant. 

It should be appreciated that Applicants have not sequenced all of the positive isolates in 
this or other the libraries herein disclosed, that some of the possible mutant proteins may not have 
been present in the library in detectable amounts, and that, at some positions, only some of the 
15 possible amino acids were intended to be included in the library. 

SECOND LIBRARY OF LACI-K1 and SELECTION OF NEW KALLIKREIN 
INHIBITORS 

Applicants prepared a second LACI-K1 library as shown in Table 750. This library utilized the 
observation of the first selection and allows variability at positions 10, 11, 13, 15, 16, 17, 18, 19, 
20 and 21. The residues at positions 34 and 39 were fixed at S 34 and G 39 . Selectants KK2/#1 
through KK2/#13, as shown in Table 2/were obtained in the same manner as described in the 
Example section for the first screeneing Applicants prepared the proteins KK2/#1 1 and KK2/#13 
in S. cerevisiae in the Mata system described hereir^^reliminary measurements indicate that 
these proteins are very potent pKA inhibitors with Kj less than 300 pM and probably less than 100 
25 pM. * * 

Using the selected sequences and the binding data of selected KuDoms, we can write a 
recipe for a high-affinity pKA-inhibiting KuDom that can be applied to other human KuDom 
parentals. First, the KuDom must meet the requirements in Table 14. The substitutions shown 
in Table 1 5 are likely to confer high-affinity pKA inhibitory activity on any KuDom. Thus a 
30 protein that contains a sequence that is a KuDom, as shown in Table 14, and that contains at each 
of the position 12-21 and 32-39 an amino-acid type shown in Table 15 for that position is likely 
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to be a potent inhibitor of human pKA. More preferably, the protein would have an amino-acid 
type shown in Table 15 for all of the positions listed in Table 15. To reduce the potential for 
immune response, one should use one or another human KuDom as parental protein to give the 
sequence outside the binding region. 

5 It is likely that a protein that comprises an amino-acid sequence that is substantially 

homologous to one of KK2/#13, KK2/#1 1, or KKII/3#6 from residue 5 through residue 55 (as 
shown in Table 2) and is identical to one of KK2/#13, KK2/#1 1, or KKII/326 at positions 13-19, 
31, 32, 34, and 39 will inhibit human pKA with a of 5 nM or less. KK2/#13, KK2/#11, and 
KKII/3#6 differs from LACI-K1 at 10, 8 , and 7 positions respectively. It is not clear that these 

10 substitutions are equally important in fostering pKA binding and inhibition. From the known pKA 
inhibitors listed, one can prepare a series of molecules that are progressively reverted toward 
LACI-K1. It is expected that the molecules will show less affinity for pKA but also less potential 
for antigenicity. A person skilled in the art can pick a protein of sufficient potency and low 
immunogenicity from this collection. It is also possible that substitutions in one of the listed pKA 

15 inhibitors by amino acids that differ from LACI-K1 can reduce the immunogenicity without 
reducing the affinity for pKA to a degree that makes the protein unsuitable for use as a drug. 
DESIGNED KuDom PKA Inhibitors 

Hereinafter, "DKI" will mean a "Designed PKA Inhibitor" that are KuDoms that incorporate 
amino-acid sequence information from the SPI series of molecules, especially KK2/213, KK2/#1 1, 
20 or KKII/3#6. Sequences of several DKIs and their parental proteins are given in Table 2. 
Hereinafter, the statement " the mutations XnnYj, XnnY^ ... may not be needed" means that each 
of the mutations might be separately found to be unnecessary. That is, the list is not to be taken 
as a block to be applied together, but as a list of things to be tested. Similarly, the lists of 
additional mutations are to be tested singly. 

25 Protein DKI-1.2.1 is based on human LACI-K2 and shown in Table 2. The mutations 

PI 1G, I13R, Y17A, I18H, T19P, Y21W, R32E, K34S, and L39G are likely to confer high affinity 
for pKA. Some of these substitutions may not be necessary; in particular, PI 1G and T19P may 
not be necessary. Other mutations that might improve the pKA affinity include E9A, D10E, 
G16A, Y21F, andL39E. 

30 Protein DKI-1.3.1 (Table 2) is based on human LACI-K3. The mutations Rl ID, L13P, 

N17A, E18H, N19P, R31E, K34S, and S36G are intended to confer high affinity for pKA. Some 
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of these substitutions may not be necessary; in particular, N19P may not be necessary. Other 

changes that might improve K D include D10E, F21W and G39E. 

Protein DKI-2. 1 (Table 2) is a based on the human collagen a3 KuDom. The mutations 
\ 

D16A, F17A, I18H, R32E, and W34S are likely to confer high affinity for pKA. Some of these 



5 substitutions may not be necessary; in particular, R32E may no^e necessary. Other mutations 

that might improve the pKA affinity include K9A, D 1 0E^JD16£^K20R, R32T, W34V, and G39E. 

DKI-3.1.1 (Table 2) is derived from Human TFPI-2 domain 1. The exchanges Y11G, 

L17A, L18H, R3 IE, and L34S are likely to confer high affinity for pKA. The mutation L34S may 

not be needed. Other mutations that might foster pKA binding include Y21W, Y21F, Q32E, 

10 L34T, L34I, and E39G. 

DKI-S^.l^Table 2Vis derived from Human TFPI-2 domain 2. This parental domain 

contains insertions afte^esidue 9 (one residue) and 42 (two residues). The mutations E15R, 

G16A, S17A_T18H,E19P, K32T, and F34V are intended to confer affinity for pKA. If one needs 
— — r-r- nU 

a pKA inhibitor based on TFPI domain 2, a preferred route is to make a library of domains 
15 allowing^ne substitutions given and many others and then select binders. 

/dKI-3.3.1 (Table 2) is derived from human TFPI-2, domain 3. The substitutions L13H, 
1 5R, ji nd N17A are likely to confer high affinity for pKA. Other mutations that might foster 
pKA binding include D10E, T19Q, Y21W, T36G, and G39E. 

DKI-4. 1.1 (Table 2) is from human ITI-K1 by assertion of S10D, M15R.M17A, T18H, 
20 Q34S, and M39G. The mutations M39G and Q34V may not be necessary. Other mutations that - 
should foster pKA binding include: G16A, M17N, S19Q, Y21W, and Y21F. 

DKI-4.2.1 (Table 2) is from human ITI-K2 through the mutations V10D, Rl ID, F17A, 
I18H, V31E, L32E, P34S, and Q39E. The mutations V31E, L32E, and Q39E might not be 
necessary. Other mutation that should foster pKA binding include: V10E, Q19P, L20R, W21F, 
25 P34I, and Q39G. DKI-4.2.2 has eight mutations: V10D, R11D, F17A, I18H, L20R, V31E, 
L32E, and P34S. . 

DKI-5. 1 is derivednrom human APP-I (also known as Protease Nexin-II) by mutations 
M17A, I18H, ?s19P _ ^31F. an d P32E and is likely to be a potent pKA inhibitor. The mutations 
S19P, A3 IE, and P32E many not be needed. Other mutations that might foster pKA binding 
30 include Tl ID. 

DKI-6. 1 is derived from the HKI B9 KuDom (NORR93) by the five substitutions: Kl ID, 
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Q15R, T16A, M17A, M18H, T19P, and L32E. DKI-6.1 is likely to be a potent pKA inhibitor. 
The mutations L32E, and T19P might not be needed. 



Although BPTI is not an especially good pKA inhibitor, it could J>emade into one. DKI- 
7.1 is derived from BPTI by the mutations Y10E, K15R, R17A, RI18H, I 19P, Q31E, T32E, and 
5 R39E which is likely to increase the affinity for pKA. The mutations Y10E, K15R, I19P, Q31E, 
T32E, and R39E nflay not be needed; the really important mutations are R17A and RI18H. 



KuDoms are quite small; if this should cause a pharmacological problem, such as excessively 
quick elimination from circulation, two or more such domains may be joined. A preferred linker 

10 is a sequence of one or more amino acids, A preferred linker is one found between repeated 
domains of a human protein, especially the linkers found in human BPTI homologues, one of 
which has two domains (BALD85, ALBR83a, ALBR83b) and another of which has three 
(WUNT88). Peptide linkers have the advantage that the entire protein may then be expressed by 
recombinant DNA techniques. It is also possible to use a nonpeptidyl linker, such as one of those 

15 commonly used to form immunogenic conjugates. An alternative means of increasing the serum 
residence of a BPTI-like KuDom is to link it to polyethyleneglycol, so called PEGylation 
(DAVI79). 

WAYS TO IMPROVE SPECIFICITY OF, FOR EXAMPLE, KKH/3#7, KK2/#11, AND 
20 KK2/#13 FOR PLASMA KALLIKREIN: 

Because we have made a large part of *fie surface of KKII/3#6, KK2/#11, and KK2/#13 
complementary to the surface of pKA, Rjf is n ot essential for specific binding to pKA. Many of 
the enzymes in the clotting and fibrinolytic pathways cut preferentially after Arg or Lys. Not 
having a basic residue at the PI position may give rise to greater specificity. The variant 

25 KKn/3#7-K15A (shown in Table 27), having an ALA at PI, is likely to be a good pKA inhibitor 
and may have higher specificity for pKA relative to other proteases than doesKKII/3#7. The 
affinity of KKH/3#7-K15A for pKA is likely to be less than the affinity of KKII/3#7 for pKA, but 
the loss of affinity for other Arg/Lys-preferring enzymes is likely to be greater and, in many 
applications, specificity is more important than affinity. Other mutants that are likely to have good 

30 affinity and very high specificity include KK2/#13-R15A and KK2/#1 1-R15S. This approach 
could be applied to other high-affinity pKA inhibitors. 





MODIFICARION OF KUNITZ DOMAINS 




MODE OF PRODUCTION 



The proteins of this invention may be produced by any conventional technique, including 

(a) nonbiological synthesis by sequential coupling of component 
amino acids, 

(b) production by recombinant DNA techniques in a suitable host cell, 



(c) 




The proteins disclosed herein are preferably produced, recombinantly, in a suitable host, such as 
10 bacteria from the genera Bacillus, Escherichia, Salmonella, Erwinia, and yeasts from the genera 
Hansenula, Kluyveromyces, Pichia, Rhinosporidium, Saccharomyces, and Schizosaccharomyces, 
or cultured mammalian cells such as COS-1. The more preferred hosts are microorganisms of the 
species Pichia pastoris, Bacillus subtilis, Bacillus brevis, Saccharomyces cerevisiae, Escherichia 
coli and Yarrowia lipolytica. Any promoter, regulatable or constitutive, which is functional in 
15 the host may be used to control gene expression. 

Preferably the proteins are secreted. Most preferably, the proteins are obtained from 
conditioned medium. It is not required that the proteins described herein be secreted. Secretion 
is the preferred route because proteins are more likely to fold correctly, can be produced in 
conditioned medium with few contaminants, and are less likely to be toxic to host cells. Secretion 
20 is not required. 

Unless there is a specific reason to include glycogroups, we prefer proteins designed to 
lack N-linked glycosylation sites to reduce potential for antigenicity of glycogroups and so that 
equivalent proteins can be expressed in a wide variety of organisms including: 1) £. coli, 2) B. 
subtilis, 3) P. pastoris, 4) S. cerevisiae, and 5) mammalian cells. 

25 Several means exist for reducing the problem of host cells producing proteases that 

degrade the recombinant product; see, inter alia BANE90 and BANE91. VAND92 reports that 
overexpression of the B. subtilis signal peptidase in E. coli. leads to increased expression of a 
heterologous fusion protein. ANBA88 reports that addition of PMSF (a serine proteases inhibitor) 
to the culture medium improved the yield of a fusion protein. 

30 Other factors that may affect production of these and other proteins disclosed herein 

include: 1) codon usage (optimizing codons for the host is preferred), 2) signal sequence, 3) 
amino-acid sequence at intended processing , sites, presence and localization of processing 
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enzymes, deletion, mutation, or inhibition of various enzymes that might alter or degrade the 
engineered product and mutations that make the host more permissive in secretion (permissive 
secretion hosts are preferred). 

Reference works on the general principles of recombinant DNA technology include 
Watson et aL, Molecular Biology of the Gene, Volumes I and II, The Benjamin/Cummings 
Publishing Company, Inc., Menlo Park, CA (1987); Darnell et aL, Molecular Cell Biology, 
Scientific American Books, Inc., New York, N.Y. (1986); Lewin, Genes n, John Wiley & Sons, 
New York, N.Y. (1985); Old, et aL, Principles of Gene Manipulation: An Introduction to 
Genetic Engineering, 2d edition, University of California Press, Berkeley, CA (1981); Sambrook 
et aL, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, Cold 
Spring Harbor, NY (1989); and Ausubel et al t Current Protocols in Molecular Biology, Wiley 
Interscience, NY, (1987, 1992). These references are herein entirely incorporated by reference 
as are the references cited therein. 
PREPARATION OF PEPTIDES 

Chemical polypeptide synthesis is a rapidly evolving area in the art, and methods of solid phase 
polypeptide synthesis are well-described in the following references, hereby entirely incorporated 
by reference: (Menifield, J Amer Chem Soc 85:2149-2154 (1963); Merrifield, Science 
232:341-347 (1986); Wade et aL, Biopolymers 25:S21-S37 (1986); Fields, Int J Polypeptide 
Prot Res 35:161 (1990); MilUGen Report Nos. 2 and 2a, Millipore Corporation, Bedford, MA, 
1987) Ausubel et al, supra, and Sambrook et al, supra. Tan and Kaiser {Biochemistry V J977 \ 
16:1531-41) synthesized BPTI and a homologue eighteen years ago. 

As is known in the art, such methods involve blocking or protecting reactive functional 
groups, such as free amino, carboxyl and thio groups. After polypeptide bond formation, the 
protective groups are removed. Thus, the addition of each amino acid residue requires several 
reaction steps for protecting and deprotecting. Current methods utilize solid phase synthesis, 
wherein the C-terminal amino acid is covalently linked to an insoluble resin particles that can be 
filtered. Reactants are removed by washing the resin particles with appropriate solvents using an 
automated machine. Various methods, inclulding the "tBoc" method and the "Fmoc" method are 
well known in the art. See, inter alia, Atherton et aL, J Chem Soc Perkin Trans 1:538-546 
(1981) and Sheppard et aL, Int J Polypeptide Prot Res 20:451-454 (1982). 
ASSAYS FOR PLASMA KALLEKREIN BINDING AND INHIBITION 
Any suitable method may be used to test the compounds of this invention. Scatchard (Ann NY 
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Acad Sci (1949) 51:660-669) described a classical method of measuring and analyzing binding 
which is applicable to protein binding. This method requires relatively pure protein and the ability 
to distinguish bound protein from unbound. 

A second appropriate method of measuring K D is to measure the inhibitory activity against 
5 the enzyme. If the K D to be measured is in the 1 ruM to 1 jiM range, this method requires 
chromogenic or fluorogenic substrates and tens of micrograms to milligrams of relatively pure 
inhibitor. For the proteins of this invention, having K D in the range 5 nM to 50 pM, nanograms 
to micrograms of inhibitor suffice. When using this method, the competition between the inhibitor 
and the enzyme substrate can give a measured Kj that is higher than the true Measurempm 



10 reported here are not so corrected because the correction would be very small andrth^any 
correction would reduce the Here, we use the measured as a direct measure of K D . 

A third method of determining the affinity of a protein for a second material is to have the 
protein displayed on a genetic package, such as Ml 3, and measure the ability of the protein to 
adhere to the immobilized "second material". This method is highly sensitive because the genetic 

15 packages can be amplified. We obtain at least, semiquantitative values for the binding constants 
by use of a pH step gradient. Inhibitors of known affinity for the protease are used to establish 
standard profiles against which other phage-displayed inhibitors are judged. Any other suitable 
method of measuring protein binding may be used. 



20 preferably at most about 300 pM, and most preferably 100 pM or less. Preferably, the binding 
is inhibitory so that Kj is the same as K D . The Kj of KKH/3#6 is about 300 pM and the IQs of 
KK2/#1 1 and KK2/#13 are less than 300 pM and probably less than 100 pM. 
PHARMACEUTICAL METHODS AND PREPARATIONS 

The preferred subject of this invention is a mammal. The invention is particularly useful in the 
25 treatment of humans, but is suitable for veternary applications too. 

Herein, "protection" includes "prevention", "suppression", and "treatment". "Prevention" 
involves administration of drug prior to the induction of disease. "Suppression" involves 
administration of drug prior to the clinical appearance of disease. "Treatment" involves 
administration of drug after the appearance of disease, 
30 In human and veterinary medicine, it may not be possible to distinguish between 

"preventing" and "suppressing" since the inductive event(s) may be unknown or latent, or the 
patient is not ascertained until after the occurrence of the inductive event(s). We use the term 




Preferably, the proteins of this invention have a K D for pKA of at most about 5nM, more 
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"prophylaxis" as distinct from "treatment" to encompass "preventing" and "suppressing". Herein, 
V> "protection" includes "prophylaxis". Protection need not-fey absolute to be useful. 

Proteins of this invention may be administered, by any means, systemically or topically, 
to protect a subject against a disease or adverse condition. For example, administration of such 
5 a composition may be by any parenteral route, by bolus injection or by gradual perfusion. 
Alternatively, or concurrently, administration may be by the oral route. A suitable regimen 
comprises administration of an effective amount of the protein, administered as a single dose or 
as several doses over a period of hours, days, months, or years. 

The suitable dosage of a protein of this invention may depend on the age, sex, health, and 
10 weight of the recipient, kind of concurrent treatment, if any, frequency of treatment, and the 
desired effect. However, the most preferred dosage can be tailored to the individual subject, as 
is understood and determinable by one of skill in the art, without undue experimentation by 
adjustment of the dose in ways known in the art. 

For methods of preclinical and clinical testing of drugs, including proteins, see, e.g., 
15 Berkow et al, eds., The Merck Manual, 15th edition, Merck and Co., Rahway, N.J., 1987; 
Goodman etaL, eds., Goodman and Gilman's The Pharmacological Basis of Therapeutics, 
8th edition, Pergamon Press, Inc., Elmsford, N.Y., (1990); Avery's Drug Treatment: Principles 
and Practice of Clinical Pharmacology and Therapeutics, 3rd edition, ADIS Press, LTD., 
Williams and Wilkins, Baltimore, MD. (1987), Ebadi, Pharmacology, Little, Brown and Co., 
20 Boston, (1985), which references and references cited there are hereby incorporated by reference. 

In addition to a protein here disclosed, a pharmaceutical composition may contain 
pharmaceutical^ acceptable carriers, excipients, or auxiliaries. See, e.g., Berker, supra,, 
Goodman, supra, Avery, supra and Ebadi, supra. 
IN VITRO DIAGNOSTIC METHODS AND REAGENTS 
25 Proteins of this invention may be applied in vitro to any suitable sample that might contain plasma 
kallikrein to measure the pKA present. To do so, the assay must include a Signal Producing 
System (SPS) providing a detectable signal that depends on the amount of pKA present. The 
signal may be detected visually or instrumentally. Possible signals include production of colored, 
fluorescent, or luminescent products, alteration of the characteristics of absorption or emission 
30 of radiation by an assay component or product, and precipitation or agglutination of a component 
or product. 

The component of the SPS most intimately associated with the diagnostic reagent is called 
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the "label". A label may be, e.g., a radioisotope, a fluorophore, an enzyme, a co-enzyme, an 
enzyme substrate, an electron-dense compound, or an agglutinable particle. A radioactive isotope 
can be detected by use of, for example, a y counter or a scintillation counter or by 
autoradiography. Isotopes which are particularly useful are 3 H, 123 I, 13l I, 35 S, 14 C, and, preferably, 
125 I. It is also possible to label a compound with a fluorescent compound. When the fluorescently 
labeled compound is exposed to light of the proper wave length, its presence can be detected. 
Among the most commonly used fluorescent labelling compounds are fluorescein isothiocyanate, 
rhodamine, phycoerythrin, phycocyanin, allophycocyanin, ophthaldehyde, and fluorescamine. 
Alternatively, fluorescence-emitting metals, such as 125 Eu or other lanthanide, may be attached to 
the binding protein using such metal chelating groups as diethylenetriaminepentaacetic acid or 
ethylenediamine-tetraacetic acid. The proteins also can be detectably labeled by coupling to a 
chemiluminescent compound, such as luminol, isolumino, theromatic acridinium ester, imidazole, 
acridinium salt, and oxalate ester. Likewise, a bioluminescent compound, such as luciferin, 
luciferase and aequorin, may be used to label the binding protein. The presence of a 
bioluminescent protein is determined by detecting the presence of luminescence. Enzyme labels, 
such as horseradish peroxidase and alkaline phosphatase, are preferred. 

There are two basic types of assays: heterogeneous and homogeneous. In heterogeneous 
assays, binding of the affinity molecule to analyte does not affect the label; thus, to determine the 
amount of analyte, bound label must be separated from free label. In homogeneous assays, the 
interaction does affect the activity of the label, and analyte can be measured without separation. 

In general, a kallikrein-binding protein (KBP) may be used diagnostically in the same way 
that an anti-pKA antibody is used. Thus, depending on the assay format, it may be used to assay 
pKA, or, by competitive inhibition, other substances which bind pKA. 

The sample will normally be a biological fluid, such as blood, urine, lymph, semen, milk, 
or cerebrospinal fluid, or a derivative thereof, or a biological tissue, e.g., a tissue section or 
homogenate. The sample could be anything. If the sample is a biological fluid or tissue, it may 
be taken from a human or other mammal, vertebrate or animal, or from a plant. The preferred 
sample is blood, or a fraction or derivative thereof. 

In one embodiment, the pKA-binding protein (KBP) is immobilized, and pKA in the 
sample is allowed to compete with a known quantity of a labeled or specifically labelable pKA 
analogue. The "pKA analogue" is a molecule capable of competing with pKA for binding to the 
KBP, which includes pKA itself. It may be labeled already, or it may be labeled subsequently by 
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specifically binding the label to a moiety differentiating the pKA analogue from pKA. The phases 
are separated, and the labeled pKA analogue in one phase is quantified. 

In a "sandwich assay", both an insolubilized pKA-binding agent (KBA), and a labeled 
KBA are employed. The pKA analyte is captured by the insolubilized KBA and is tagged by the 
labeled KBA, forming a tertiary complex. The reagents may be added to the sample in any order. 
The KBAs may be the same or different, and only one KBA need be a KBP according to this 
invention (the other may be, e.g., an antibody). The amount of labeled KBA in the tertiary 
complex is directly proportional to the amount of pKA in the sample. 

The two embodiments described above are both heterogeneous assays. A homogeneous 
assay requires only that the label be affected by the binding of the KBP to pKA, The pKA 
analyte may act as its own label if a pKA inhibitor is used as a diagnostic reagent. 

A label may be conjugated, directly or indirectly (e.g., through a labeled anti-KBP 
antibody), covalently (e.g., with SPDP) or noncovalently, to the pKA-binding protein, to produce 
a diagnostic reagent. Similarly, the pKA binding protein may be conjugated to a solid phase 
support to form a solid phase ("capture") diagnostic reagent. Suitable supports include glass, 
polystyrene, polypropylene, polyethylene, dextran, nylon, amylases, and magnetite. The carrier 
can be soluble to some extent or insoluble for the purposes of this invention. The support material 
may have any structure so long as the coupled molecule is capable of binding pKA. 
IN VIVO DIAGNOSTIC USES 

A Kunitz domain that binds yery tightly to pKA can be used for in vivo imaging. . Diagnostic 
imaging of disease foci was considered one of the largest commercial opportunities for 
monoclonal antibodies, but this opportunity has not been achieved. Despite considerable effort, 
only two monoclonal antibody-based imaging agents have been approved. The disappointing 
results obtained with monoclonal antibodies is due in large measure to: 

i) Inadequate affinity and/or specificity; 

ii) Poor penetration to target sites; 

iii) Slow clearance from nontarget sites; 

iv) Immunogenicity (most are murine); and 

v) High production cost and poor stability. 

These limitations have led most in the diagnostic imaging field to begin to develop peptide-based 
imaging agents. While potentially solving the problems of poor penetration and slow clearance, 
peptide-based imaging agents are unlikely to possess adequate affinity, specificity and in vivo 
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stability to be useful in most applications. 

Engineered proteins are uniquely suited to the requirements for an imaging agent. In 
particular the extraordinary affinity and specificity that is obtainable by engineering small, stable, 
human-origin protein domains having known in vivo clearance rates and mechanisms combine to 
5 provide earlier, more reliable results, less toxicity/side effects, lower production and storage cost, 
and greater convenience of label preparation. Indeed, it should be possible to achieve the goal 
of realtime imaging with engineered protein imaging agents. Thus, a Kallikrein-binding protein, 
e.g., KKH/3#6, KK2/#11, and KK2/#13 may be used for localizing sites of excessive pKA 
activity. 

10 Radio-labelled binding protein may be administered to the human or animal subject. 

Administration is typically by injection, e.g., intravenous or arterial or other means of 
administration in a quantity sufficient to permit subsequent dynamic and/or static imaging using 
suitable radio-detecting devices. The dosage is the smallest amount capable of providing a 
diagnostically effective image, and may be determined by means conventional in the art, using 
15 known radio-imaging agents as guides. 

Typically, the imaging is carried out on the whole body of the subject, or on that portion 
of the body or organ relevant to the condition or disease under study. The radio-labelled binding 
. protein has accumulated. The amount of radio-labelled binding protein accumulated at a given 
point in time in relevant target organs can then be quantified. 
20 A particularly suitable radio-detecting device is a scintillation camera, such as a y camera. 

The detection device in the camera senses and records (and optional digitizes) the radioactive 
decay. Digitized information can be analyzed in any suitable way, many of which are known in 
the art. For example, a time-activity analysis can illustrate uptake through clearance of the radio- 
labelled binding protein by the target organs with time. 
25 Various factors are taken into consideration in picking an appropriate radioisotope. The 

isotope is picked: to allow good quality resolution upon imaging, to be safe for diagnostic use in 
humans and animals, and, preferably, to have a short half-life so as to decrease the amount of 
radiation received by the body. The radioisotope used should preferably be pharmacologically 
inert, and the quantities administered should not have substantial physiological effect. The binding 
30 protein may be radio-labelled with different isotopes of iodine, for example 123 I, l25 I, or l3I I (see, 
for example, U.S. Patent 4,609,725). The amount of labeling must be suitably monitored. 

In applications to human subjects, it may be desirable to use radioisotopes other than l25 I 
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for labelling to decrease the total dosimetry exposure of the body and t o optimize th e detectability 
of the labelled molecule. Considering ready clinical availability for use in humans, preferred radio- 
labels include: "To, 67 Ga, <*Ga, 1H In, U3m In, l23 I, l86 Re, m Re or 2ll At Radio-labelled protein 
may be prepared by various methods. These include radio-halogenation by the chloramine-T or 
lactoperoxidase method and subsequent purification by high pressure liquid chromatography, for 
example, see Gutkowska et al in "Endocrinology and Metabolism Clinics of America: (1987) 16 
(1):183. Other methods of radio-labelling can be used, such as IODOBEADS™. 

A radio-labelled protein may be administered by any means that enables the active agent 
to reach the agent's site of action in a mammal. Because proteins are subject to digestion when 
administered orally, parenteral administration, /.e., intravenous subcutaneous, intramuscular, 
would ordinarily be used to optimize absorption. 

High-affinity, high-specificity inhibitors are also useful for in vitro diagnostics of excess 
human pKA activity. 
Other Uses 

The kallikrein-binding proteins of this invention may also be used to purify kallikrein from 
a fluid, e.g., blood. For this, the KBP is preferably immobilized on a support. Such supports, 
include those already mentioned as useful in preparing solid phase diagnostic reagents. 

Proteins can be used as molecular weight markers for reference in the separation or 
purification of proteins. Proteins may need to be denatured to serve as molecular weight markers. 
A second general utility for proteins is the use of hydrolyzed protein as a nutrient source. Proteins 
may also be used to increase the viscosity of a solution. 

The proteins of this invention may be used for any of the foregoing purposes, as well as 
for therapeutic and diagnostic purposes as discussed further earlier in this specification. 



EXAMPLE 1: CONSTRUCTION OF FIRST LACI-K1 LIBRARY 

A synthetic oligonucleotide duplex having Nsil- and iW7wI-corapatible ends was cloned into 
a parental vector (LACLIII) previously cleaved with the above two enzymes. The resultant 
ligated material was transfected by electroporation into XLIMR^( F) Es cherichia coli strain and 
plated on Amp plates to obtain phage-generating Ap R colonies. The variegation scheme for Phase 
1 focuses on the PI region, and affected residues 13, 16, 17, 18 and 19. It allowed for 6.6 x 10 5 
different DNA sequences (3.1 x 10 5 different protein sequences). The library obtained consisted 
of 1.4 x 10 6 independent cfu's which is approximately a two fold representation of the whole 
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library. The phage stock generated from this plating gave a total titer of 1 .4 x 10 13 pfu's in about 
3.9 ml, with each independent clone being represented, on average, 1 x 10 7 in total and 2.6 x 10 6 
times per ml of phage stock. 

To allow for variegation of residues 31, 32, 34 and 39 (phase II), synthetic oligonucleotide 
5 duplexes withM/wI- and fi^/EII- compatible ends were cloned into previously cleaved Rf DNA 
derived from one of the following 

i) the parental construction, 

ii) the phase I library, or 

iii) display phage selected from the first phase binding to a given target. 

10 The variegation scheme for phase II allows for 4096 different DNA sequences (1600 different 
protein sequences) due to alterations at residues 3 1, 32, 34 and 39. The final phase II variegation 
is dependent upon the level of variegation remaining following the three rounds of binding and 
elution with a given target in phase I. 



15 sequences or 5.0 x 10 7 different protein sequences. When previously selected display phage are 
used as the origin of Rf DNA for the phase II variegation, the final level of variegation is probably 
in the range of 10 5 to 10 6 . 

Example 2: Screening of LACI (Kl) Library for Binding to Kallikrein 



20 incubation of the phage-display library with the kallikrein-beads of interest in a buffered solution 
(PBS containing 1 mg/ml BSA) followed by washing away the unbound and poorly retained 
display-phage variant with PBS containing 0.1% Tween 20. Kallikrein beads were made by 
coupling human plasma Kallikrein (Calbiochem, San Diego, CA, # 420302) to agarose beads 
using Reactigel (6x) (Pierce, Rockford, II, #202606). The more strongly bound display-phage are 

25- eluted with a low pH elution buffer, typically citrate buffer (pH 2.0) containing 1 mg/ml BSA, 
which is immediately neutralized with Tris buffer to pH 7.5. This process constitutes a single 
round of selection. 

The neutralized eluted display-phage can be either used: 

i) to inoculate an F + strain of E. coli to generate a new display-phage stock, to be used for 
30 subsequent rounds of selection (so-called conventional screening), or 

ii) be used directly for another immediate round of selection with the protease beads (so- 
called quick screening). 



The combined possible variegation for both phases equals 2.7 x 10 s different DNA 



The overall scheme for selecting a LACI-K1 variant to bind to a given protease involves 
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Typically, three rounds of either method, or a combination of the two, are performed to give rise 
to the final selected display-phage from which a representative number are sequenced and 
analyzed for binding properties either as pools of display-phage or as individual clones. 

Two phases of selection were performed, each consisting of three rounds of binding and 
5 elution. Phase I selection used the phase I library (variegated residues 13, 16, 17, 18, and 19) 
which went through three rounds of binding and elution against a target protease giving rise to 
a subpopulation of clones. The Rf DNA derived from this selected subpopulation was used to 
generate the Phase II library (addition of variegated residues 3 1, 32, 34 and 39). The 1. 8 x 10 7 
independent transformants were obtained for each of the phase II libraries. The phase II libraries 
10 underwent three further rounds of binding and elution with the same target protease giving rise 
to the final selectants. 

Following two phases of selection against human plasma kallikrein-agarose beads a 
number (10) of the final selection display-phage were sequenced. The amino-acid sequences are 
shown in Table 2, entries KBPconl through KKH/3#C. 
15 Table 23 shows that KkII/3(D) is a highly specific inhibitor of human Kallikrein. Phage 

that display the LACI-K1 derivative KkII/3(D) bind to Kallikrein beads at least 50-times more 
than it binds to other protease targets. 

Preliminary measurements indicate that KKII/3#6 is a potent inhibitor of pKA with 
probably less than 500 pM. 

20 

EXPRESSION, PURIFICATION AND KINETIC ANALYSIS, 

The three isolates KKII/3#6, KK2/#11, and KK2/#13 were recloned into a yeast expression 
vector. The yeast expression vector is derived from pMFalpha8 (KURJ82 and MIYA85). The 
LACI variant genes were fused to part of the mat a 1 gene, generating a hybrid gene consisting of 

25 the mata 1 promoter-signal peptide and leader sequence-fused to the LACI variant. The cloning 
site is shown in Table 24. Note that the correctly processed LAC^-Kl^riant ^xrqtein should be 
^ as detailed in Table 2 with the addition of residues glu-ala-ala-gl^otne N-terminal met (residue 
1 in Table 2). Expression in S. cerevisiae gave acceptable yield typical of this system. Yeast- 
expressed LACI (kunitz domain 1), BPTI and LACI variants: KKH/3#6, KK2/21 1, and KK2/#13 

30 were purified by affinity chromatography using trypsin-agarose beads. 

For larger-scale production, Pichia pastoris is a preferred host. The most preferred 
production sytem in P. pastoris is the alcohol oxidase system. Others have produced a number 
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of proteins in the yeast Pichia pastoris. For example, Vedvick et al. (VEDV9 1 ) and Wagner et 
al (WAGN92) produced aprotinin from the alcohol oxidase promoter with induction by methanol 
as a secreted protein in the culture medium at =1 mg/ml. Gregg et al. (GREG93) have reviewed 
production of a number of proteins in P. pastoris. Table 1 of GREG93 shows proteins that have 
5 been produced in P. pastoris and the yields. 

All references, including those to U.S. and foreign patents or patent applications, and to 
nonpatent disclosures, are hereby incorporated by reference in their entirety. 
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TABLE 1 : Sequence of whole LACI : 



15 5 
1 MIYTMKKVHA LWASVCLLLN 



.5 5 5 

LAPAPLNAcis eedeehtiit dteloolklM 



FFNIFTRQCE EFIYGGCEGN QNRFESLEEC 
kpdfcfleed pgicrgyitr yfynnqtkqc 



adrglcrane nrfyynsvig kcrpfkysgc 

fiqriskggl iktkrkrkkq rvkiayeeif 



10 



51 HSFCAFKADD GPCKAIMKRF 
101 KKMCTRDnan riikttlqqe 
151 erfkyggclg nmnnfetlee cknicedg pn gfqvdnygtq lnavnnsltp 
201 qstkvpslfe fhgpswcltp 
251 ggnennftsk qeclrackkg 
301 vknin (SEQ ID NO. 18) 

The signal sequence (1-28) is uppercase and underscored 
LACI-K1 is uppercase 
LACI-K2 is underscored 
LACI-K3 is bold 



TABLE 2 is below. 
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Table 3: Summary of first selection of LACI-K1 domains for 
binding to pKA. 


BPTT # 


(Lac I) 


Library Resides 


Preferred Residues 


13 


P 


LHPR 


HP 


16 


A 


AG 


AG 


17 


I 


FYLHINA 
SCPRTVD 
G 


NSA 


18 


M 


all 


HL 


19 


K 


LWQMKAG 
SPRTVE 


QLP 


31 


E 


EQ 


E 


32 


E 


V 

■ ,EQ 


EQ 


34 


I 


all 


STI 


39 


E 


all 


GEA 
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Tahle 2: Seauences of Kunitz domains, some of which inhibit human pKA. 




Went 


Mino-acid sequence 

1111111111222222222233333333334444444444555555555 

1234567890123456789012345678901234567890123456789012345678 


Ou\l IV INU. 


BPTI 


RPDFCLEPPYTGPCKMIIRYFYNAKAGLCQTFVYGGCRAKRNNFKSAEDCMRTCGGA 


qro ID NO 2 


LACHCl 


mhsfcafkaddgpckaimkrfffniftrqceefiyggcegnqnrfesleeckkmctrd 


SEO ID NO 3 


KBPconl 


mhsfcafkaddgHckaNHQrfffniftrqcEEfSyggcGgnqnrfesleeckkmctrd 


qpn TD NO 4 


KKII/3#1 


mhsfcafkaddgHckASLPrfffniftrqcEEflyggcEgnqnrfesleeckkmctrd 


SEQ ID NO. 5 


KKII/312 


mhsfcafkaddgPckANHLrfffniftrqcEEfSyggcGgnqnrfesleeckkmctrd 


SEQ ID NO. 6 


KKII/3//3 


rahsfcafkaddgHckANHQrfffniftrqcEEfTyggcGgnqnrresieecKKinctra 


SEQ ID NO. 7 


KKII/3#4 
KKII/3#5 


mhsfcafkaddgHckANHQrfffniftrqcEQfTyggcAgnqnrfesleeckkmctrd 
mhsfcafkaddgHckASLPrfffniftrqcEEflyggcGgnqnrfesleeckkmctrd 


SEQ ID NO. 8 
SEQ ID NO. 9 


KKII/3#6 


mhsfcafkaddgHckANHQrfffniftrqcEEfSyggcGgnqnrfesleeckkmctrd 


SEO ID NO. 10 


KKII/3//7 


mhsfcafkaddgHckMHQrriinirtrqcLbibyggcbgnqnriebieeLKKiiitLLu 


SFO ID NO 11 


KKTT/3^8 


nihsfcafkaddgHckANHQrfffniftrqcEEfSyggcGgnqnrfesleeckkmctrd 


SEQ ID NO. 12 


KKII/3/19 


mhsfcafkaddgHckANHQrfffniftrqcEEfSyggcGgnqnrfesleeckkmctrd 


SEQ ID NO. 13 


KKII/3//10 


mhsfcafkaddgHckGAHLrfffniftrqcEEflyggcEgnqnrfesleeckkmctrd 


SEQ ID NO. 14 


KKI/3(a) 


mhsfcafkaddgRfkGAHLrfffniftrqceefiyggcegnqnrfesleeckkmctrd 


SEQ ID NO. 15 
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Table 2: Sequences of Kunitz domains, some of which inhibit human pKA, 



Mino-acid sequence 

1111111111222222222233333333334444444444555555555 

17^4567890123456789012345678901234567890123456789012345678 



KKI/3(b) 



mhsfcafkaddgPckAIHLrfffniftrqceefiyggcegnqnrfesleeckkmctrd 



SEQ ID NO. 



SEQ ID NO. 16 



KKII/3C 



mhsfcafkaddgHckANHQrfffniftrqcEEfSyggcGgnqnrfesleeckkmctrd 



SEQ ID NO. 17 



KK2/M 



mhsfcafkaDGgRcRGAHPrWffniftrqcEEfSyggcGgnqnrfesleeckkmctrd 



;eo ID NO. 19 



KK2/I14 



mhsfcafkaDGgRcRGAHPrWffniftrqcEEfSyggcGgnqnrfesleeckkmctrd 



SEQ ID NO. 20 



KM 
KK2/W 



mhsfcafkaDDgPcRMHPrWffniftr qcEEfSyggcGgnqnrfesleeckkmctrd 
rnhsfcafkaDDgPcRAAHPrWffniftrqcEEfSyggcGgnqnrfesleeckkmctrd 



SEQ ID NO. 21 



SEQ ID NO. 22 



KM 



mhsfcafkaDVgRcRGAHPrWffniftrqcEEfSyggcGgnqnrfesleeckkmctrd 



SEQ ID NO. 23 



KK2/04 



mhsfcafkaDVgRcRGAQPrFffniftrqcEEfSyggcGgnqnrfesleeckkmctrd 



SEQ ID NO. 24 



KK2/#6 



mhsfcafkaDDgScRAAHLrWffniftrqcEEfSyggcGgnqnrfesleeckkmctrd 



SEQ ID NO. 25 



raio 



mhsfcafkaEGgScRAAHQrWffniftrqcEEfSyggcGgnqnrfesleeckkmctrd 



SEQ ID NO. 26 



KK2///8 



mhsfcafkaDDgPcRGAHLrFffniftrqcEEfSyggcGgnqnrfesleeckkmctrd 



SEQ- ID NO. 27 



KK2///3 



mhsfcafkaDDgHcRGALPrHffniftrqcEEfSyggcGgnqnrfesleecktanctrd 



SEQ ID NO. 28 



mini 



mhsfcafkaPSgNcRGNLPrFffniftrqc EEfSyggcGgngnrfesleeckkmctrd 
mhsfcafkaDSqI\cRGNHQrFffniftrqcEEfSyggcGgnqnrfesleeckkmctrd 



SEQ ID NO. 29 



SEQ ID NO. 30 
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TahlP 2: Seauences of Kunitz domains, some of which inhibit human pKA. 




Idcnt 


Amino-acid sequence 

1111111111222222222233333333334444444444555555555 

1234567890123456789012345678901234567890123456789012345678 


SEQ ID NO. 


KK2/#12 


mhsfcafkaDGgRcRAIQPrWffniftrqcEEfSyggcGgnqnrfesleeckkmctrd 


nn a T T*\ VTA 0 1 

SEQ ID NO. 31 


KK2conl 


inhsfcafkaDDgRcRGAHPrWffniftrqcEEfSyggcGgnqnrfesleeckkmctrd 


SEQ ID NO. 32 


Human LACI-K2 


KPDFCFLEEDPGICRGYITRYFYNNdTKQCERFKYGGCLGNMNNFETLEECKNICEDG 


SEQ ID NO. 33 


DKI-1.2.1 


kpdfcfleedGgRcrgAHPrWfynnqtkqceEfSyggcGgnmnnfetleecknicedg 


SEQ ID NO. 34 


Human LACI-K3 


GPSWCLTPADRGLCRANENRFYYNSVIGKCRPFKYSGCGGNENNFTSKQECLRACKKG 


SEQ ID NO. 35 


DKI-1.3.1 


gpswcltpadDgPcraAHPrfyynsvigkcEpfSysgcggnennftskqeclrackkg 


SEQ ID NO. 36 


Human collagen a3 
KuDom 


ETDICKLPKDEGTCRDFILKWYYDPNTKSCARFWYGGCGGNENKFGSQKECEKVCAPV 


SEQ ID NO. 37 


DKI-2.I 


etdicklpkdegtcrAAHlkwyydpntkscaEfSyggcggnenkfgsqkecekvcapv 


SEQ ID NO. 38 


TFPI-2 DOMAIN 1 
DKI-3.1.1 


NAEICLLPLDYGPCRALLLRYYYDRYTQSCRQFLYGGCEGNANNFYTWEACDDACWRI 
naeicllpldGgpcraAHlryyydrytqscEqfSyggcegnannfytweacddacwri 


SEQ ID NO. 39 
SEQ ID NO. 40 


tfpi-2 DOMAIN 2 


VPKVCRLQVS- 

VDDQCEGSTEKYFFNLSSMTCEKFFSGGCHRNR- 

IENRFPDEATCMGFCAPK 

. ! " 


SEQ ID NO. 41 
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Table 2: Sequences of Kunitz domains, some of which inhibit human pKA. 




Ident 


flm-i nA-aPl H COrfllPflPP 

MinO.dLlU ocvjUcllLC 

1111111111222222222233333333334444444444555555555 
1234567890123456789012345678901234567890123456789012345678 


SEQ ID NO. 


DKI-3.2.1 


vpkvcrlqvs- 

vddqcRAAHPkyffnlssmtceEffsggchrnr- 

ienrfpdeatcingfcapk 


SEQ ID NO. 42 


TFPI-2 DOMAIN 3 


IPSFCYSPKDEGLCSANVTRYYFNPRYRTCDAFTYTGCGGNDNNFVbRLUCKKAlMA 


SEQ ID NO. 43 


DKI-3.3.1 


ipsfcyspkdegHcRaMQryyrnpryrtra^ 


SEQ ID NO. 44 


HUMAN ITI-K1 


KEDSCQLGYSAGPCMbMlbKiM^ v 


SEQ ID NO. 45 


DKI-4.1.1 


kedscqlgyDagpcRgAHPryfyngtsmacetfSyggcGgngnnfvtekeclqtcrtv 


SEQ ID NO. 46 


tt TTT V) 

Human ITI-K2 


TVMCNLPIVRGPCRAFIQLWAFDAVKGKCVLFPYGGCQGNGNKFYSEKECREYCGVP 


SEQ ID NO. 47 


DKM.2.1 


HraarnininnrmrraAHalwafdavkakcEEfSvaacEqnqnkfysekecreycgvp 


SEQ. ID NO. 48 


DKI-4.2.2 


L„,,„i n ; nn/Yn^raaHrfnwafHavVaVrFEfSvaacoflnonkfvsekecrevccivp 
tv&acnipiuugpcrannqAWaiua r 


SEQ ID NO. 49 


HUMAN 
PROTEASE 
NEXIN-II 


VREVCSEQAETGPCRMISRWYFDVTEGKCAPFFYGGCGGNRNNFDTEEYCMAVCGSA 


SEQ ID NO. 50 


DKI-5.1 


vrevcseqaetgpcraAHPrwyfdvtegkcEEfSyggcggnrnnfdteeycmavcgsa 


SEQ ID NO. 51 


HKIB9 domain 


LPNVCAFPMEKGPCQTYMTRWFFNFETGECELFAYGGCGGNSNNFLRKEKCEKFCKFT 


SEQ ID NO. 53 


DKI-6.1 


lpnvcafpmeDgpcRAAHPrwffnfetgeceEfayggcggnsnnflrkekcekfckft 


SEQ ID NO. 54 
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Table 2: Sequences of Kunitz domains, some of which inhibit human pKA. 


Ident 


Amino-acid sequence 

1111111111222222222233333333334444444444555555555 

1234567890123456789012345678901234561890123456789012345678 


SEQ ID NO. 


DKI-7.1 


rpdfcleppEtgpcRaAHPryfynakaglcEEfvyggcGakrnnfksaedcmrtcgga 


SEQ ID NO. 55 



Table 3 is above. 

TABLE 8: Binding Data for Selected Kallikrein-binding Display-Phage. 

5 Display-Phage (a) Fraction Bound (b) Relative Binding (c) 




LACI 

BPTI 

KKI/3(a) 

KKI/3(b) 

KKII/3H5 

KKII/316 



4.2 x 10" 6 

2.5 x 10" 5 

3.2 x 10" 3 

2.2 x lO -3 

3.9 x 10" 3 

8.7 x 10- 3 



1.0 
6.0 

761 
524 
928 
2071 



(a) Clonal isolates of display-phage. LACI-K1 is the parental molecule, BPTI (bovine pancreatic 
trypsin inhibitor) is a control and KKII/3 (5 and 6) and KKI/3(a and b) were selected by binding 

15 to the target protease, kallikrein. 

(b) The number of pfu's eluted after a binding experiment as a fraction of the input number (10 10 

pfu's). ! 

(c) Fraction bound relative to the parental display-phage, LACI-K1. 
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Table 9: Conservative and Semiconservative substitutions 



Initial 
AA type 


Category 


Conservative 
substitution 


Semi -conservative 
substitution 


A 


Small non- 
polar or 
slightly 
polar 


G, S, T 


N, V, P, (C) 




free SH 


A, M, L, V, I 


F, G 


C 


disulfide 


nothing 


nothing 


D 


acidic, 
hydrophilic 


E, N, S, T, Q 


K, R, H, A 


E 


acidic, 
hydrophilic 


D, Q, S, T, N 


K, R, H, A 


F 


aromatic 


W, Y, H, L, M 


1/ V, (C) 


G 


Gly-only 
conformation 


nothing 


nothing 


"normal" 
conformation 


A, S, N, T 


D, E, H, I, K, L, M, 
Q, R, V 


H 


amphoteric 
aromatic 


Y, F, K, R 


L, M, A, (C) 


I 


aliphatic, 
branched (3 
carbon 


V, L, M, A 


F, Y, W, G (C) 


K 


basic 


R, H 


Q, N, S, T, D, E, A 


L 


aliphatic 


M, I, V, A 


F, Y, W, H, (C) 


M 


hydrophobic 


L, I, V, A 


Q, F, Y, W, (C), (R), 
(K) , (E) 


XT 

N 


non-polar 
hydrophilic 


S, T, (D), Q, 
A, G, (E) 


K, R 


P 


inflexible 


V, I 


A, (C), (D), (E), F, 
H, (K) , L, M, N, Q, 
(R) , S, T, W, Y 


Q 


aliphatic 
plus amide 


N, E, A, S, T, 
D 


M, L, K, R 


R 


basic 


K, Q, H 


S, T, E, D, A, 


S 


hydrophilic 


A, T, G, N 


D, E, R, K 
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Table 9: Conservative and Semiconservative substitutions 


Initial 
AA type 


Category 


Conservative 
substitution 


Semi -conservative 
substitution 


T 


hydrophilic 


A, S, G, N, V 


D, E, R, K, I 


V 


aliphatic, 
branched (J 
carbon 


I, L, M, A, T 


P, (C) 


w 


aromatic 


F, Y, H 


L, M, I, V, (C) 


Y 


aromatic 


F, W, H 


L, M, I, V, (C) 



-5 ^ 



Changing from A, F, H, I, L, M, P, V, W, or Y to C is 
semiconservative if the new cysteine remains as a free thiol. 

Changing from M to E, R, K is semiconservative if the ionic 
10 tip of the new side group can reach the protein surface while 
the methylene groups make hydrophobic contacts. 

Changing from P to one of K, R, E, or D is semiconservative if 
the side group is on or near the surface of the protein. 

15 
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Table 14: 


Definition of a Kunitz Domain (SEQ ID HO. 52) 




1 




2 3 4 5 


12345 67 8 9012345 67 8 901234567 89012345 67 8 9012345 6789012345 67 8 


xxxxCxxxxxxGxCxxxxxxXXXxxxxxxCxxFxXXGCxXxxXxXxxxxxCxxxCxxx 


vvn ere . 








XI, 


X2, X3, X4, X58, X57 , and X56 may be absent, 


X21 




Phe, Tyr, Trp, 


X22 




Tyr or 


Phe, 


X2 3 




Tyr or 


Phe, 


X35 




Tyr or 


Trp, 


X36 




Gly or 


Ser , 


X40 




Gly or 


Ala, 


X43 




Asn or 


Gly, and 


X45 




Phe or 


Tyr 



Table 15: Substitution to confer high affinity for pKA on KuDoms 


Position 


Preferred 


Allowed 


Unlikely to work 


10 


Asp, Glu 


Ala, Gly, Ser, Thr 


Lys, Asn, (Arg, Cys, 
Phe, His, lie, Leu, 
Met, Pro, Gin, Val, 
Trp, Tyr) 


11 


Asp, Gly, Ser, Val 


Glu, Leu, Met, [Asn, 
He, Ala, Thr] 


(Cys, Phe, His, Lys, 
Pro, Gin, Arg, Trp, 
Tyr) 


12 


Gly 


(Other amino acids 
ONLYifC 14 -C 3s 
disulfide replaced by 
other amino acids.) 







Table 1 5 : Substitution to confer high affinity for pKA on KuDoms 


Position 


Preferred 


Allowed 


Unlikely to work 


13 


Arg, His, Pro, Asn, 
Ser 


[Thr, Ala, Gly, Lys, 
Gin] 


Phe, Tyr, Cys, Leu, 
He, Val, Asp (Glu, 
Met, Trp) 


14 


Cys 


(Other amino acids 
ONLYifC 3 g also 
changed.) 




15 


Arg, Lys 


[Ala, Ser, Gly, Met, 
Asn, Gin] 


(Cys, Asp, Glu, Phe, 
His, He, Leu, Pro, 
Thr, Val, Trp, Tyr) 


16 


Ala, Gly 


[Ser, Asp, Asn] 


(Cys, Glu, Phe, His, 
He, Lys, Leu, Met, 
Pro, Gin, Arg, Thr, 
Val, Trp, Tyr) 


17 


Ala, Asn, Ser, Be 


[Gly, Val, Gin, Thr 


Cys, Asp, Phe, His, 
Pro, Arg, Tyr, (Glu, 
Lys, Met, Trp) 


18 


His, Leu, Gin 


[Ala, 


Cys, Asp, Glu, Phe, 
Gly, He, Lys, Met, 
Asn, Pro, Arg, Ser, 
Thr, Val, Trp, Tyr 


19 


Pro, Gin, Leu 


[Asn, He] 


Ala, Glu, Gly, Met, 
Arg, Ser, Thr, Val, 
Trp, (Cys, Asp, Phe, 
His, Tyr) 


20 


Arg 


Leu, Ala, Ser, Lys, 
Gin, Val 


(Cys, Asp, Glu, Phe, 
Gly, His, He, Met, 
Asn, Pro, Thr, Trp, 
Tyr) 
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Table 15: Substitution to confer high affinity for pKA on KuDoms 


Position 


Preferred 


Allowed 


Unlikely to work 


21 


Trp, Phe 


[Tyr, His, He] 


Cys, Leu (Ala, Asp, 
Glu, Gly, Lys, Met, 
Asn, Pro, Gin, Arg, 
Ser, Thr, Val) 


31 


Glu 


[Asp, Gin, Asn, Ser, 
Ala, Val, Leu, He, 
Thr] 


(Arg, Lys, Cys, Phe, 
Gly, His, Met, Pro, 
Trp, Tyr) 


32 


Glu, Gin 


[Asp, Asn, Pro, Thr, 
Leu, Ser, Ala, Gly, 
Val] 


(Cys, Phe, His, lie, 
Lys, Met, Arg, Trp, 
Tyr) 


33 


Phe 


[Tyr] 


other 18 excluded. 


34 


Ser, Thr, lie 


[Val, Ala, Asn, Gly, 
Leu] 


Cys, Asp, Glu, Phe, 
His, Lys, Met, Pro, 
Gin, Arg, Trp, Tyr 


35 


Tyr 


[Trp, Phe] 


(other 17) 


36 


Gly 


Ser, Ala 


(other 17) 


37 


Gly 


(Other amino-acid 
types allowed only if 
C14-C38 replaced by 
other types.) 




38 


Cys 


(Other amino acids 
ONLY if C, 4 also 
changed.) 




39 


Gly, Glu, Ala 


[Ser, Asp] 


Other 15. 



Under "Preferred", most highly preferred type are bold 

Under "Allowed" are types not actually tested, but judged to be acceptable. Types shown in 
square brackets were allowed and not selected, but are so similar to types that were selected 





that the type is unlikely to abolish pKA binding. Such types are not preferred, but pKA- 
binding proteins could have such types. 

Under "Unlikely to work", types shown outside parentheses have been tried and no isolates 
had that type; types in parentheses have not been tested, but are judged to be unsuitable from 
5 consideration of the types actually excluded. 
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TABLE 21: First Variegation of LACI-K1 

abl234567 8 9 10 
37^Y AEMHS FCA.FKAD 

I gcc I gag | atg I cat I tc 
I Nsil I 



10 



15 



20 



L|P 

,D G H | R C. K A|G G D|G G R 
11 12 13 14 15 16 17 18 19 20 
I gat I ggt | cNt | tgt | aaa | gSt I NNt I NNS | NNg | cgt I 



4 
F 

ttcl 


5 6 
C A 
tgc I gcc I 




I IN 






C|H 




Fl S 


F | Y 




Y | C 


L|S 


LIS 


L|P 


W|P 


W|P 


H | R 


QIR 


QIR 


I I T 


M| T 


M|T 


N|V 


K|V 


K|V 


AID 


A|E 


A|E 


G 
17 


D|G 
18 


G 
19 



FFFNIFTRQC 
21 22 23 24 25 26 27 28 29 30 
I ttc | ttc | ttc | aac | ate | ttc | acg | cgt | cag | tgc | 
25 I Mlul 1 

Q | E N | H 

M| K C | I 

F|S F | Y 

30 Y|C LIS 

L|P W|P 

H | R QIR 

I|T M | T 

N|V K|V 

35 A|D A|E 

E | Q E | Q F G|W Y G G C G|D G N Q 
31 32 33 34 35 36 37 38 39 40 41 42 
I Sag I Saa | ttc | NNS I tac | ggt I ggt I tgt | NNS | ggt | aac | cag | 

I BstEII I 



<51 
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Table 21, continued 



# 



NRFESLEE 
43 44 45 46 47 48 49 50 
I aac | egg I ttc I gaa | tct I eta I gag I gaa 
I BstBI I I Xbal I 
I Agel I 



10 51 52 53 54 55 56 57 58 59 60 

. C K KMC T RDGA 
I tgt | aag I aag I atg I tgc | act | cgt | gac | ggc gec 

I KasI j 

15 The segment from Nsil to Mlul gives 65, 536 DNA sequences and 
31,200 protein sequences. Second group of variegation gives 
21, 840 and 32,768 variants. This variegation can go in on a 
fragment having Mlul and one of Agel, BstBI, or Xbal ends* 
Because of the closeness between codon 42 and the 3 1 

20 restriction site, one will make a self-priming 

oligonucleotide, fill in, and cut with Mlul and, for example, 

BstBI . Total variants are 2.726 x 10 9 and 8.59 x 10 9 . 

The DNA sequence has SEQ ID NO. 56. 

The amino acid sequence has SEQ ID NO. 57. 

""25 
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TABLE 23: 


Specificity 


Results 








Protein 
displayed 


Immobilized enzyme tested 


on M13 
glllp 


Plasmin 


Thrombin 


pKA 


Trypsin 


Trypsin, 
2 washes 


LACI-K1 


1 


1 


1 


1 


1 


KkII/3.(D) 


3.4 


1.5 


196. 


2. 


1.4 


BPTI 


88 . 


1.1 


1.7 


0.3 


.8 



35 Numbers refer to relative binding of phage display clones compared to the parental phage 
display. 

The KkII/3(D)(Kallikrein) clone retains the parental molecule's affinity for trypsin. KkII/3(D) 
was selected for binding to pKA. 
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TABLE 24 : Mat a S. CBrevlslae expression vectors : 
Matal (Mfa8) 

K R P R SEQ ID NO. 58 

5 ' - ... | AAA | AGG | CCT | CGA | G . . . - 3 1 SEQ ID NO. 59 

I StuI I 

I Xhol | 



Mata2 (after introduction of a linker into Stul-cut DNA) 

10 amino acids: SEQ ID NO. 60 

KREAAEPWGA. L E 

5 ' | AAA | AGG | GAA | GCG I GCC | GAG | CCA | TGG | GGC | GCC | TAA| TAG | CTC I GAG I 3 ' 

I EagI | 1 Styl \ KasI \ \ Xhol | 

DNA: SEQ ID NO. 61 

15 

Mata-LACI-Kl, amino acids: SEQ ID NO. 62, DNA: SEQ ID NO. 63 
abcd l2345678 
KREAAEMHS FC AFK 
5 • | AAA | AGG | GAA | GCG | GCC | GAG | atg | cat I tec | ttc | tgc | get | ttc | aaa | 
20 I EagI I I Nsil I 

9 10 11 12 13 14 15 16 17 18 19 20 
ADDGPCKAIMKR 
I get | gat | gaC I ggT | ccG | tgt | aaa | get I ate | atg I aaa | cgt I 
25 | RsrII | | BspKI \ 

21 22 23 24 25 26 27 28 29 30 
FFFNIFTRQC 
I ttc | ttc | ttc | aac | att | ttc I acG I cgt I cag I tgc | 
30 | Mlul | 

31 32 33 34 '35 36 37 38 39 40 41 42 
EE FIYGGCEGNQ 
I gag I gaAl ttC I att I tac I ggt I ggt I tgt I gaa I ggt I aac I cag | 
35 | EcoRI | | BstEII | 

43 44 45 46 47 48 49 50 
NRFESLEE 
I aac I cgG I ttc I gaa I tct I ctAl gag I gaa I 
40 I I BstBI I 1 Xbal. I 

1 Agel | 



51 52 53 54 55 56 57 58 59 60 
CKKMCTRDGA 
45 | tgt I aag I aag I atg I tgc I act I cgt I gac I ggc I gec I TAA | TAG I CTC I GAG | -3 • 

I KasI I I Xhol I 



We expect that Mator pre sequence is cleaved before GLUa-ALA,,- 



50 




• 
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Table 27 : High specificity plasma Kallikrein inhibitors 



10 



15 



20 



LACI-K1 (SEQ ID NO. 3) 

MHS FCAFKADDGPCKAIMKRFFFNI FTRQCEEFI YGGCEGNQNRFESLEECKKMCTRD 
KKII/3#7 (SEQ ID NO. 11) 

mhsf caf kaddgHckANHQrf f fnif trqcEEf SyggcGgnqnrf esleeckkmctrd 
KKII/3#7-K15A (SEQ ID NO. 64) 

mhsf caf kaddghcAanhqrf f fnif trqceef syggcggnqnrf esleeckkmctrd 
KK2/#13-R15A (SEQ ID NO. 65) 

mhsf caf kaDGgRcAGAHPrWf fnif trqcEEf SyggcGgnqnrf esleeckkmctrd 
KK2/#11-R15S (SEQ ID NO. 66) 

mhsf caf kaddgpcSaahprwf fnif trqceef syggcggnqnrf esleeckkmctrd 
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Table 49 



10 



15 



20 



Residue Number 





10 


11 


12 


13 


14 


15 


16 


17 


18 


19 


20 


_21_ 
F 


LACI-K1 


D 


D 


G 


p 


C 


K 


A 


I 


M 


K 


R 


Consensus of 
KKII/ J 

selectants 


a 


a 


g 


T_T 

n 


c 


K 


A 


N 


H 


Q 


r 


f 


KK Library #2 


NK 
DE 


NI 
AD 
ST 
GV 


g 


FS 
YC 
LP 
HR 
IT 
NV 
AD 
G 


c 
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Table 750: DNA that embodies Library KKF. 



K 

M H S F C A F K A D|E 



i / 1 2 3 4 5 6 7 8 910 

/ 5'-cctcct atg cat tec ttc tgc gec ttc aag get RaS 

' | Nsil 1 

F| S 

10 Y|C 

L | H 
Rl I 

I|V V|T V|T (M) (K) 

T | A A|G A|S L|P L|P 

15 S |G D|N G I D Q | R R | H 

D|N G P C K | R A|G I|N H Q R 
11 12 13 14 15 16 17 18 19 20 
RNt ggt NNt tgt aRa gSt RNt cNS cNS cgt 

20 W|L 

F|C F F N I F T R (SEQ ID NO. 69) 

21 22 23 24 25 26 27 28 

tKS ttc ttc aac ate ttc acg cgt tccctcc-3' (SEQ ID NO. 67) 

3'-g ttg tag aag tgc gca agggagg-5' (SEQ ID NO. 68) 

25 | M2ul | 

The Rsrll and BspKl sites found in the parental LACI-K1 
display gene (Table 6) are not present in Library KKF. 

30 There are 1,536,000 amino-acid sequences and 4,194,304 DNA 
sequences. Met 18 and Lys 19 are not allowed in Library KKF. 
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